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Abstract 

Cutting a cake is a metaphor for the problem of dividing a resource (cake) among 
several agents. The problem becomes non-trivial when the agents have different valuations 
for different parts of the cake (i.e. one agent may like chocolate while the other may like 
cream) . A fair division of the cake is one that takes into account the individual valuations 
of agents and partitions the cake based on some fairness criterion. Fair division may be 
accomplished in a distributed or centralized way. Due to its natural and practical appeal, 
it has been a subject of study in economics under the topic of "Fair Division". To best 
of our knowledge the role of partial information in fair division has not been studied 
so far from an information theoretic perspective. In this paper we study two important 
algorithms in fair division, namely "divide and choose" and "adjusted winner" for the case 
of two agents. We quantify the benefit of negotiation in the divide and choose algorithm, 
and its use in tricking the adjusted winner algorithm. Lastly we consider a centralized 
algorithm for maximizing the overall welfare of the agents under the Nash collective utility 
function (CUF). This corresponds to a clustering problem of the type traditionally studied 
in data mining and machine learning. Drawing a conceptual link between this problem 
and the portfolio selection problem in stock markets, we prove an upper bound on the 
increase of the Nash CUF for a clustering refinement. 

1 Introduction 

In many applications a number of parties are interested in possessing a limited resource, e.g. 
a set of goods or metaphorically a cake. Each of the parties has his own valuation of different 
parts of the cake, and each has full, partial or no information about the valuation of the other 
parties. Finding a way to divide a cake fairly has attracted the attention of economists and 
mathematicians for a long time. Before trying to find a fair division, one must define the term 
"fairness". Several criteria of fairness have been introduced to judge the goodness of a division 
where none of which subsumes the others [1]. Here we will give a brief introduction to four of 
them. Assume that k denotes the number of parties. 

• A division is said to be proportional if each party receives at least -r of the entire cake 
w.r.t. his own valuation. 

• A division is said to be equitable if the piece of the cake each party obtains w.r.t. his own 
valuation is exactly equal to what the other parties receive (w.r.t. their own valuation). 

• A division is said to be envy-free if no party believes that, w.r.t. his own valuation, the 
piece another party has received is more valuable than his own. 

• A division is said to be efficient or Pareto optimal if it is not possible to find another 
division that increases the gain of every individual. 
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In the literature of fair division, there are two major assumptions regarding the set of goods 
to be divided: the category of divisible goods where each good or item could be divided among 
parties, and the category of indivisible goods where each item should wholly be given to one 
party (e.g. a car or a laptop) [2]. Analyzing division of divisible goods is generally easier than 
that of indivisible goods. In the most generic scenario some of the items may be divisible, some 
indivisible and some partially divisible. We take care of this generic scenario by considering a 
set T> of "admissible" divisions of the resource. Theoretically the set T> is of size infinity if we 
have a divisible item in the resource (since we can cut that item in any proportion). Practically 
speaking, even divisible items can be cut up to a certain precision. Therefore for simplicity we 
assume that the set V is finite (unless stated otherwise). Lastly, the preferences or valuations 
of parties could be ordinal or cardinal. Here we assume that valuations are cardinal, i.e. could 
be modeled by non-negative real numbers. 

The literature on fair division generally assumes that the division game is played just once, 
and that each party chooses an action that maximizes the value of the minimum piece that the 
action can guarantee. To make this problem amenable to information theoretic analysis, we are 
going to relax the traditional formulation of the problem by introducing notions that parallel 
the familiar concepts of block coding and vanishing probability of error (as compared to zero 
probability of error) in communication theory. We assume that i.i.d. repetitions of the game is 
played multiple times, and that the average gain of a party over the games can be guaranteed 
with high probability. Since we are relaxing the formulation, our bounds serve as upper bounds 
to the traditional one-shot problem. Therefore we are also implicitly addressing the traditional 
problem. 

Any algorithm providing a fair division may satisfy one or some of the fairness conditions in- 
troduced above. From another point of view, fair division may be accomplished in a distributed 
or centralized way. In a distributed algorithm the individuals should divide the cake amongst 
themselves, while in a centralized one, an external referee divides the cake for them. In order to 
address these two categories, we have chosen two prominent algorithms from the field, Divide- 
and- Choose (DC) from the category of distributed algorithms and Adjusted Winner (AW) from 
the category of centralized algorithms. 

The "/ cut, you choose 11 or divide- and- choose (DC) procedure is a well-known and ancient 
algorithm for dividing a resource among two parties [1]. The story of dividing a land between 
Abram and Lot in the Hebrew Bible refers to this method. In this procedure, the first party 
(Alice) cuts the cake into two parts and the second party (Bob) chooses one of the pieces, 
leaving the other piece for the first party. Note that Bob has an advantage over Alice for he 
can choose the best piece and can possibly get even more than half of the total value he assigns 
to the cake. In other words, when Alice does not know anything about Bob's valuation, she 
should divide the cake into two parts which are equal with respect to her valuation, so that 
despite of Bob's choice, she gains at least half of the cake. However Bob achieves more than 
half of the cake since he is free to choose. Since each party can obtain at least half the cake, 
this method is proportional but not equitable [3]. 

The " Adjusted Winner 11 (AW) algorithm was originally proposed by Brams and Taylor [1]. 
Assume that two parties, say Alice and Bob, want to divide a set of m divisible goods. Alice's 
valuation vector is denoted by a vector a = (at, ... , a m ) of m non- negative real numbers that 
add up to one. Similarly, Bob's valuation vector is denoted by b = (b\, . . . ,b m ). We assume 
that the value of a piece of cake for each player is the sum of the portion of each item present in 
that piece times the value that player assigns to that item. In the Adjusted Winner algorithm 
Alice and Bob announce their valuations vectors to an external referee. The referee solves a set 
of equations to come up with a division of the items which is proportional, equitable, envy-free 
and efficient. Brams and Taylor showed that in the case of having two goods, i.e. m = 2, when 
one of the parties, say Alice, knows Bob's valuation while Bob is unaware of this, Alice can 
announce an untrue valuation in order to trick the procedure and gain more than what she is 
expected. See [1, 4, 2] for further reading on fair division. 
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To the best of our knowledge, the problem of fair division has only been analyzed when 
individuals do not know the valuation of others, or when they have complete information about 
the valuations; it is not analyzed in the case of partial information. To motivate this study, let 
us begin with the DC algorithm As we saw previously, the second party, Bob, has advantage in 
choosing the piece he likes more. One way to make the algorithm more fair is to provide Alice 
with partial information about Bob's valuation. For instance if there is an item that Alice likes 
a lot but Bob is indifferent to it - and Alice knows this - she can put all of it in the piece that 
she predicts Bob will not choose. To quantize the role of information in such scenarios, we need 
to find the gain of individuals as a function of the rate of communication between them. This 
leads to characterizing the achievable rate-gain region. The tradeoff between the disadvantage 
of being the cutter and the advantage of having information is most notably present in a seller- 
consumer scenario. A seller offers a good for a price, and the consumer can choose to buy 
the item or keep his money. This problem resembles the DC algorithm and our formulation 
(defined later) is general enough to cover it. Setting a price by the seller resembles cutting a 
cake, and the consumer's choice of buying the item is like picking one of the two pieces "item" 
or "his money". As discussed above this transaction scheme is naturally biased towards the 
chooser, i.e. the consumer. But the seller has generally more information about the consumer's 
needs than the consumer has about the true price of the item. The role of information in the 
bargaining dynamic is also colorful: the consumer hides how much he really needs the item 
while the seller hides how much the item is really worth. 

The last part of this paper considers the role of information in optimizing the social welfare, 
another topic in fair division. In the literature of economics, a social welfare is a function 
that collects the utilities or gains of each individual in the society and returns a real value 
which reflects the overall welfare in the society. Philosophical utilitarianism suggests a division 
strategy that maximizes the overall happiness (or sum of the gains of the individuals). Thus, 
the rules of division here are not decided by selfish players but by an external judge (or by 
players who follow Rawls's veil of ignorance [5]). Another measure for social welfare that cares 
not only about the overall happiness but also about its uniform distribution over the individuals 
(an egalitarian philosophy) is the Nash collective utility function (CUF). Nash CUF is defined 
to be product of the gains of the individuals [6]. Motivated by the role of Nash CUF in fair 
division in large societies, we formulate a problem and study it from an information theoretic 
perspective. 

2 Divide and Choose 

In the divide and choose problem, two players, say Alice and Bob are about to cut a cake. Each 
of them are interested in the different parts of the cake with different valuations, but they are 
not aware of the valuation of the other player. In order for the division to be fair, Alice first 
cuts the cake into two parts arbitrarily. Then Bob chooses one of the pieces, indeed the part 
he likes more, and leaves the other part for Alice. 

We assume that the value each player gives to different pieces of the cake is a random variable 
on the set of possible values V which is assumed to be finite. We have no specific assumption 
over V, but for having an intuition, one can consider the following special case. Imagine the 
cake has m items: chocolate, cream, cherry - • • . In this particular example, a valuation vector 
v is a vector of size m, (v±, . . . , v m ), whose indices are nonnegative real numbers adding up to 
one. The indicies indicate interest in individual items. Thus if a certain piece of the cake has 
portion of item i, the value associated to this piece w.r.t. v is YlT=i a i v i- However it should 
be noted that in the general case, we do not assume that valuations are vectors. 

Just like the way it goes in the real life, it is quite reasonable to assume that the two 
players wish to negotiate and gain information about each other's valuations and then start the 
procedure so that they can achieve a better cut. In fact, without transferring any information, 
Bob has an advantage in the game since he is free to choose the piece that is more valuable for 
him. In other words, when Alice does not know anything about Bob's valuation, she should 
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Figure 1: A schematic of the r round negotiation process. 



divide the cake into two parts that are equal with respect to her valuation, so that despite of 
Bob's choice, she gets at least half of the cake. However in this way, Bob achieves more than 
half of the cake since he is free to choose. Based on this intuition, we are interested to find 
the way the gain of each player and the second player's advantage would change if the parties 
negotiate interactively before cutting the cake. This could be modeled in the following way, 
first Bob gives Alice some information about his valuation, then Alice asks him some questions, 
then Bob answers her questions and perhaps provides some more information. This procedure 
continues for r rounds. Finally Alice cuts the cake based on the information she has gained 
about Bob's valuation. Afterwards Bob chooses one piece and leaves the other for Alice. We 
are interested in analyzing how the gain of each player depends on the amount of information 
they communicate during the r rounds. A schematic of the procedure is given in Figure 1. 



2.1 Definitions 

As mentioned in the introduction we assume that P, the set admissible divisions or admissible 
cuts, is finite. The gain of each player is a deterministic function of the valuations and the 
particular division d G XX This is formalized in the following definition: 

Definition 1. Assume that va and vb are the valuations of Alice and Bob respectively and 
Alice has divided the cake by d 6 V. Then Qa {d,v A \vB) and Qb {d,v A \vB) denote the gain of 
Alice and Bob respectively in one game. 

Consider n i.i.d. repetitions of the game and consider the average gain over these games. 
Valuations of Alice and Bob over the n games are denoted by two sequences of length n, VJ{ for 
Alice and Vg for Bob. These two sequences are independently and identically generated from 
the joint distribution p(va,vb)- The joint distribution p(va,vb) is assumed to be revealed to 
both Alice and Bob. 

Rab denotes the communication rate per game from Alice to Bob during r rounds, i.e. it 
is equal to the total number of bits sent from Alice to Bob divided by n. The rate Rba is 
defined similarly as the overall communication rate from Bob Alice. The formal definition of 
an n-game, r-round negotiation code is in order. 

Definition 2. An r-round, n-game (n, Rab, Rba) code consists of communication variables 
Ci, . . . , C r with encoders p(c p \v B , C[i :p _i]) for odd p and p(c p \v% C[i :/J _i]) for even p as well as a 
division strategy p(d n \v 7 \,C[i :r j) where 

n 

p:even ^ ^ 

- V H(C P ) < Rba- 

p:odd 
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The gains associated with this code are random variables 

1 n 

G a = -Y j Ga (D h V Aji \\V B>i ) , 

(2) 

G b = -Y j Gb (Di,VA,i\\V B J ■ 
n z — ' 

i=i 

In fact the r-round information exchange consists of communication C\ from Bob to Alice, 
C2 from Alice to Bob and so on, therefore odd indices indicate Bob to Alice communication 
and even indices indicate Alice to Bob communication. The division D n over n games is then 
performed by Alice based on all the information she has: communication over all r rounds and 
her own preferences V%- This is visualized in Figure 1. 

Definition 3. For a fix number of interactive negotiation, v, a (Rab, Rba, G a, G b) rate gain 
tuple is said to be achievable if for any 5 > and N, there exists a (n, Rab, Rba) code with 
n > N where the associated gains Ga and Gb satisfy the following inequalities with probability 
at least 1 — 5: 

\Ga — Ga\ < 5 \Gb — Gb\ < 5. (3) 

Definition 4. The rate gain region for r-round communication is the closure of all achievable 
tuples (Rab, Rba,Ga,Gb) and is denoted by IZ(r). 

Remark 1. Our formulation has two differences with the traditional game theoretic setup. 
Firstly the number of games n converges to infinity. Secondly we are not following the maximin 
rule (i.e. maximizing the minimum gain) with probability one. Instead we are demanding a 
guarantee with probability 1 — 5 where 5 converges to zero only after n converges to infinity. 
In a sense the traditional setup corresponds to the "zero- error" capacity in the communication 
literature. Nonetheless our results are still relevant if one is only interested in a finite length 
game. Our formulation is a relaxation of the traditional game theoretic setup; therefore our 
results constitute an upper bound to a finite length game setup. 



2.2 Main Results 

Our main result in this part is to identify the rate gain region for r rounds: 

Theorem 1. IflZ(r) denotes the closure of all rate gain tuples (Rab, Rba,Ga,Gb) such that 

R A B>I(V A ;F [l:r] \V B ), 
RBA>I(V B ;F ll:r] \V A ), 

E[Ga(D,V a \\V b )] = G a , [ ' 

^[gB(D,V A \\V B )] = GB, 

for some (F^^D) G T(r); where T(r) denotes the set of all finite random variables Fh. r ] and 
random variable D on the set of all divisions T> such that, 

V A - V B , - F p p odd, 

V B - V A , F[i-. P -i] - F p p even, (5) 
V B - V A , F [1:r] - D, 

then il(r) = lZ(r). 

This Theorem is proved in two steps: proof of the achievability is given in Appendix A.l 
and the converse in Appendix A. 2. 
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In Appendix A. 3 we show that the region introduced above is computable. It suffices to 
compute the convex hull of the region obtained by restricting the cardinality of Fi to 



i^i<ivini^i. w 



for 1 < % < r. 

To illustrate several aspects of the result, we consider a few examples. Imagine the cake 
has only two items, say cream and chocolate, and the set of possible valuations is V = {O, •} 
where O denotes complete interest in cream and no interest in chocolate, i.e. O = (1,0) while 
• = (0, 1) denotes complete interest in chocolate. Assume the cake is half cream and half 
chocolate and the set of possible divisions is T> = {-3-,-©-} where -3- means dividing the cake 
so that in each piece we have half cream and half chocolate and -©- means dividing the cake so 
that one piece is full cream and one is full chocolate. Assume that the joint distribution over 
valuations, p(va,Vb) is as p(0,0) = p(#,#) = 2/6 and p(0, •) = p(9, 0) = 1/6. 

Since for the general case of r round communication, the region falls in M 4 , we assume r = 1, 
therefore Rab = and the region consists of triples (Rba,Ga,Gb)- For a fixed R, if we limit 
the communication rate to be bounded by R, i.e. R B a < R, the set of achievable gain pairs 
(Ga, Gb) form a region in M? which is illustrated in Figure 2 For different values of R. 

Observe that the gain pairs (Ga,Gb) = (|, \) and (Ga,Gb) = (§, 1) are in the region for 
any value of R since they can be achieved by adopting the fixed division strategies of -3- and ■©■ 
in all of the i.i.d. repetitions of the game respectively. As depicted in Fig. 2, it turns out that 
time sharing between these two strategies is optimal in the extreme case of R — 0. Consider 
the extreme case of Alice having full information about the valuation of Bob, i.e. when R is 
equal to the Slepian-Wolf communication rate R = H(Vb\Va) = 0.92. In this case, Alice can 
use the division -3- in each instance of the game when Bob likes the item that she likes, and the 
division when Bob likes the other item. This gives the point (Ga,Gb) = (§, §)• Although 
not appealing, Alice can choose a strategy that gives rise to the gain pair (Ga,Gb) = (|, §) 
by using -©- when Bob likes the item that she likes, and -3- when Bob likes the other item. As 
depicted in the figure, time sharing between these strategies is optimal. In the case of partial 
information when R is neither nor H(Vb\Va), the region of achievable gains expands as R 
increases, which is expected since we are always allowed not to use the extra information. Note 
that always Gb > 1/2, since Bob chooses the more valuable piece, which is indeed at least half 
the cake. The maximum achievable Ga is 2/3 which could be achieved by full information, i.e. 
when Alice knows Bob's valuation. 

In a practical scenario it is quite reasonable to assume that Alice uses the information 
selfishly in order to maximize her gain. Therefore we can define the selfish gain G s a(R) to be 
maximum gain Alice can obtain limiting the communication rate to a value R, i.e. 

G s f = max G A - 

Rba<R 

In practice, this scenario is more applicable when Alice spies over Bob's valuations with a spying 
rate R (in a single round r = 1). Note that we assume that Bob always chooses the piece he 
likes more with no concern about Alice's gain. Let G s g(R) denote the gain associated with Bob 
in this case. Since we want to study the equitability of division (a fairness criterion discussed 
at the beginning of the introduction), we define the difference between these two gains as 

A scl (i?) = Gf{R) - Gf(R). (7) 

A spying rate R results in an equitable division if A sel (i?) = 0. 

Figures 3, 4 and 5 respectively show the values of G s a, G s g l and A sel as functions of R for 
our example. As we see, Alice's spying gain always increases with the rate, which is expected, 
since she can use ignore the extra spying information. However, the interesting observation is 
that Bob's gain increases up to some value for small rates and then decreases. This means that 
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Figure 2: The gain region for constant rate. 



G Amax for (2/6,1/6,1/6,2/6) 
0.68 1 1 1 1 1 1 1 




Figure 3: The maximum achievable gain for Alice when she acts selfishly, as a function of 
the rate of communication R. 
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G for (2/6,1/6,1/6,2/6) 
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Figure 4: The gain associated with Bob when Alice acts selfishly, G s § 1 , as a function of the rate 
of communication R. 



A for (2/6,1/6,1/6,2/6) 
max 

0.1 4 | 1 1 1 1 1 r— 




Figure 5: Difference between two selfish gains, A sel (i?) = G s g(R) — G s £ l (R), as a function of 
the rate of communication R. 
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Figure 6: G B as a function of R for the probability distribution (8). 

up to a point, sharing information is advantageous for both sides. The other point is that the 
value of A sel is zero only when R = and R > H(Vb\Va), this suggests that the division is 
equitable just in case of zero information or full information. The reason for this is that the 
divisions are so that Bob's gain is always greater or equal than that of Alice. In other words for 
any va,v b G V and d ET>, Q b (d, va\\vb) > Qa (d, va\\vb), therefore we always have Gb > Ga- 
However, other behaviors can be observed when changing V, T> and the joint probability. 
For instance, by keeping V and T> unchanged, but changing the joint probability distribution 
as 



p(o,o) 



1 6 

14' = 

5 2 

— , P {;%) = —. 



(8) 



14 
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We observe that Bob's gain, G S B (R) initially decreases and then increases slightly, as depicted 
in Figure 6. In this example, unlike the latter one, it is more probable that the two players 
have different valuations, therefore in the case of zero information, it is more beneficiary for 
Alice to divide the cake by ■©■ which results in a gain of 1 for Bob. The rate gain region for 
this example is illustrated in Figure 7. 

As was discussed before, Bob's gain will be always greater than or equal to Alice's for the 
choice of V = {-©-,-3-}. Now, we change the setup to, 



V = {0,€>}, 

X? = {^,-©}, 

p(0,0) = - p(0,€>) 
o 

p(f>,0) = - p(€>,€>) 

o 



2 

6' 
1 

6' 



(9) 



where €) means 2/3 interest in chocolate and 1/3 interest in cream, O means 1/3 interest in 
chocolate and 2/3 interest in cream, -9 means dividing in a way so that in one piece we have 
all chocolate and 3/16 of the whole cream and denotes dividing in a way so that in one piece 
we have all the cream and 3/16 chocolate. In this case, Figures 8, 9 and 10 show G s ^ 1 , G s B l 
and A sel respectively as a functions of R. As we see for a rate R eq , A sel (R eq ) = which shows 
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Figure 7: The rate gain region for the probability distribution (8). 

that with the information rate of R eq , the division is equitable, while for information rate less 
than that amount, Bob has advantage and with more information rate, Alice has advantage. 
In fact, this value of information makes an equilibrium between the natural advantage of Bob 
over Alice and the information Alice gains about Bob's valuations. Figure 11 shows that the 
rate region for this example is a part of a line in the plane for all values of R. 

Another interesting fact could be observed by changing the probability distribution of (9) 
into 

V = {0,€>}, 

V = {-V,-Q.}, 

P (o,o) = i p(o,«>) = ^, (10) 
p(«,o) = A ?(•>,•) = 

As we can see in Figure 12, Bob's gain first increases, then decreases and then increases again. 
The region of this setup is depicted in 13. This together with our latter observations suggest 
that Bob's gain does not have an specific behavior in general. 



3 Adjusted Winner 

Assume two parties, say Alice and Bob, are about to divide a set of m goods. Unlike the Divide 
and Choose method, they announce their valuations over these goods which are nonnegative 
vectors of sum 1 and size m, a = (a±, . . . , a m ) for Alice and b = (pi, ... , b m ) for Bob to a third 
party whose duty is to divide these items fairly based on these announced valuations. The 
Adjusted Winner is an algorithm that solves a sequence of equations in order to give a division 
of the items which is equitable, envy free and efficient [1]. We note that the divide and choose 
method does not have these properties. 

Brams and Taylor showed that in the case of having two items, a dishonest party who has 
full information about the other party's valuation vector, while the other party is unaware of 
this and acts honestly, can trick the referee [1]. We extend this to the case of partial information. 
We assume that Bob announces his valuation honestly while Alice uses the partial information 
he has gained by spying over Bob's valuation to trick the referee and announce an untrue 
valuation instead of her true valuation. 
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Figure 8: Gf(R) for the setup of (9) 



G Bmax for (1/6,2/6,2/6,1/6) 
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Figure 9: Gf(R) for the setup of (9) 
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A for (1/6,2/6,2/6,1/6) 
max v ' ' 

0.04 1 1 1 1 1 1 r— 
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Figure 10: A se \R) for the setup of (9) 
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Figure 11: The rate gain region for the setup of (9) 
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Figure 12: Gf(R) for the setup of (10) 
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We find the trade off between the " spying rate" and Alice's " spying gain" . We analyze this 
tradeoff in two possible cases: the first case is when the set of valuations is finite and Alice 
can spy any arbitrary function of Bob's valuation vector consistent with her spying rate. The 
assumption that the set of valuations is finite is a practical assumption since we can assume 
that the value assigned to an item by each individual is a real number with finite precision. 
Therefore the set of all valuation vectors is finite. We find the tradeoff in this case via a simple 
transformation from AW to DC. 

In the second case, we assume that the number of items, m is equal to 2 and Alice's valuation 
is fixed, while Bob's valuation of the first item is uniformly distributed in an interval [6 m i n , &max] 
(since there are two items and AW assumes that valuation vectors are of sum one, each valuation 
vector is of the form (x, 1—x) where x G (0, 1), therefore could be expressed by one real number, 
which is x in this case). Most importantly in this case we consider a particular (but practical) 
set of binary searching questions of the form "Is Bob's valuation on the first item less than a 
particular value a or more than that?" By asking such questions, at each step we divide the 
interval into two subintervals. We assume that Alice can ask R questions on average in each 
game, or totally nR questions. We derive upper bounds for the improvement of Alice's spying 
gain as a function of spying rate R. 

3.1 The Model 

The adjusted winner algorithm divides the items as follows. Reorder the items so that, 

^>^>...>^>1>^±1>...>^ (11) 
bi b 2 b t b t+1 b m 

give items 1 through t to Alice and items t + 1 through m to Bob. If their gains at this step is 
equal, the job is finished. First assume Alice's gain is more. In this case, give a portion of item 
t so that their gain becomes equal. If even by giving all of item t this did not happen, go for 
item t — 1 and and continue this procedure until the equality holds. For the second case when 
Bob's gain is more, in a similar way, give a portion of item t + 1 to Alice to achieve equality. If 
this was not sufficient, go to item t + 2 and continue. Since eventually by giving all the items to 
the party with less gain, his gain becomes more, at some point in between their gains become 
equal and the procedure terminates. 

One of our results on adjusted winner considers the case where there are two items, i.e. 
m — 2. In this case the valuations are (a, 1 — a) for Alice and (b, 1—6) for Bob. Therefore 
we can simply take a and b as valuation numbers or more simply valuations. If Alice has some 
information about Bob's valuation, she may be able to announce her valuation untruly as a so 
as to gain more than she would if she announced her true valuation a. We assume that Bob 
announces his valuation honestly. A practical scenario of this model could be when Alice gains 
information about Bob by spying over his valuation. 

3.2 Adjusted Winner Algorithm for two goods 

Using the procedure of Adjusted Winner as was explained above, we will find the exact form 
of the division given by the algorithm in the case of m = 2, as a function of algorithm's 
inputs, a and b respectively as valuations of Alice and Bob. In this case, AW a (a, b) is a 
vector of length 2, say (d^d^), indicating the portion of goods given to Alice. Similarly 
AWb (a,b) = (dfjd^) denotes the portion of goods given to Bob. Since we divide the goods 
between parties, df + df — 1, d — 1, 2. In the following, since we are interested in Alice's gain, 
we use AW (a, b) for AW^ (a, b) unless otherwise stated. 

Using the Adjusted Winner algorithm discussed before, we can derive the following formu- 
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Figure 14: An example of \I/ as a function of a for 6 = 0.3 and a = 0.1. Note the discontinuity 
of the function at a = b, its convexity in the intervals (0, b) and (1 — b, 1) and its concavity in 
the interval (b, 1 — 6). 



lation for AW (a, b), as derived in Appendix B.l: 



AW (a, 6) 



< a < min(l - 6,6), 
max(l — 6, 6) < a < 1, 
6 < a < 1 -6A6 < 1/2, 
.(1-^,1) l-6<a<6A6> 1/2. 



(12) 



3.3 Definitions 

Definition 5. Alice's gain when she announces valuation (5,1 — a) lo/w/e aer irue valuation 
is (a, 1 — a) m tae case that Bob's true valuation is (6, 1 — 6) which is equal to his announced 
valuation is denoted by \1/ (a, a\\b) which is equal to, 

V(a,a\\b) = AW(a,b)-(a,l-a). (13) 

Using (12) we can write the exact expression of this function as we will see later. An example 
of \I/ is presented in Figure 14. 

In the case where 6 is uniformly distributed in [6 min , 6 max ], the gain associated with a in an 
integral with respect to 6, which is discussed in the following definition. 

Definition 6. Alice's expected gain when she announces valuation (5,1 — 5) while her true 
valuation is (a, I — a) in the case that Bob's true valuation is uniformly distributed in [6 m j n , b max ] 



and he acts honestly is denoted by ^ (a,a\\b min ,b ma ,, i 



\l/ (a, cj||6 m j n , b max ) 



h — h ■ , 

u max u mvn J b 



and is equal to, 

Umax 

* (a, a|| 6) d6. 



(14) 



Note that a and 6 are the two inputs to the Adjusted Winner algorithm. When we integrate 
over 6, at one point in the integration 6 = a. As is discussed in Appendix B.l in the case where 
the two inputs to the Adjusted Winner algorithm are identical, there are two possible divisions 
of the cake as the output of the algorithm. If both players had announced their valuations truly, 
these two divisions would give them the same gains; however, in our scenario, Alice announces 
an untrue valuation. Thus, when a = 6, these two valuations result in two different gains for 
Alice and the function under integration is not defined in this one point. However, since the 
integral is not dependent on the value of one point, we can omit it. 

Definition 7. The maximum expected value of Alice 's gain with above conditions is defined as, 



^* (a\\b min ,b max ) = max \l> (a,a||6 m ,„, 6 



.. mini '-'max) 

0<a<l 

max * (a,a\\b min ,b max ) . 

'■'■mill. 

<a<b m 



(15) 
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The second line in (15) suggests that the optimum value of a for \l/ falls in the interval 
[b m m , & max ] which is justified in Corollary 3. 

Definition 8. For a fixed value of Alice's valuation, a, and Bob's valuation b uniform in 
[b min , b max \ and a series of dividing points for k questions b min = b < ■ ■ ■ < b 2 k = b max , the 
improvement of Alice's gain by asking this set of questions is denoted by 

b b' 

A k (b , ...,b 2 k) = Y2 (a||6i_i, h) - ^* {a\\b min , b max ) . (16) 

— Ook — On 

8=1 1 U 

The maximum improvement by asking k questions is 

A* k (b min , bmax) = max A fc (b , . . . , b 2 k) . (17) 

bmin — bo<---<b 2 k —b max 

Note that the term f—z^ i R (16) is the probability of the event b G [&i_i,£>i]; in fact, 
Afc (b , . . . , b 2 k) is the expected value of Alice's improvement in gain having the fact that b is 
uniformly distributed in [6 m i n , b m3iX }. 

Note that A* is defined on intervals. Later, when we want to prove upper bounds on gain 
improvement, it will be convenient to work with a special set of these functions, which we name 
interval concave. Note that this terminology is not related to the concept of concavity and is 
used simply because the condition has similarities to what we have for concave functions. 

Definition 9. If Ai is the set of all pairs (x, y) G R 2 such that b m %n < x < y < b max , a function 
A : Ai — > R is said to be interval concave in [b m in, b m ax] if for all (x, y) G Ai and x < t < y we 
have, 

— A(x, t) + ^A(t, y) < A(x, y). (18) 
y — x y — x 

3.4 Main Results 

Our results on adjusted winner can be divided into two parts. In subsection 3.4.1 we assume 
that Alice can spy any arbitrary function of Bob's valuation subject to a rate constraint. In 
subsection 3.4.2 we assume that m = 2 and that Bob's valuation of the first item is uniformly 
distributed in an interval [6 m i n ,6 ma x]- For the sake of simplicity we assume that 1/2 < 5 min , 
i.e. the entire interval falls in the right half. Note that this is reasonable, since in practice it 
means that Alice knows which of the two items Bob likes more, but she does not know his exact 
valuation. Also note that the case which 6 max < 1/2 (the entire interval falls in the left half) 
could be reduced to this case by changing the order of items. 

We assume that Alice can gain information by dividing [b min , & max ] into sub interval by asking 
binary questions, i.e. if she wants to ask k questions she divides the interval into 2 fc subintervals, 

b mm = b < bi < ■ ■ ■ < b 2 k = 6 max , (19) 

therefore by asking the first question she finds out whether b falls in subinterval [6 m i n , b 2 k-i] or 
(b 2 k-i, 6 max ]. If the answer is the left subinterval she divides it by asking b 2 k-2, otherwise she 
divides the right subinterval by b 3x2 k-2 and so on. Note that this kind of questioning guarantees 
that at each step the distribution over b remains uniform. Alice's goal is to find the optimal 
dividing questions, b\, . . . ,b 2 k_i and an announced (untrue) valuation di, 1 < i < 2 k for each 
subinterval so that her average gain (over the randomness of Bob's valuation which is assumed 
to be uniform) is the maximum possible. We are interested in analyzing the role of number of 
questions Alice can ask on the improvement in her gain. 

Motivated by this, we will address the problem of distributing a number of questions among 
n games. For this, assume that during n games, Bob's valuation is i.i.d. random variables 
uniformly distributed in [6 m i n , b max ] and a is fixed in all games. More precisely, if Alice is allowed 
to ask R questions on average in each game, or totally nR questions, we are interested in finding 
bounds for the Alice's expected improvement in gain averaged over n games considering R as 
a factor indicating the amount of information. 
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3.4.1 Finite Valuations 

If we assume that the valuations are limited to be finite, our result for divide and choose 
could also be applied to the problem where the role of Alice's information on her announced 
valuation is under investigation. In fact, we can completely solve the problem and characterize 
the "spying" gain rate region. In this case Alice's announced valuation, a, plays the role of the 
division D in divide and choose and the following gain functions could be defined, 

£ A (a,a||b) = AW A (a,b) ■ a, 
Qb (a,a||b) = AW b (a,b) ■ b. 

Note that although the two problems have conceptual differences, by using this transformation, 
we can consider this problem a special case of divide and choose. Also note that in this 
approach, even the assumption of m = 2 is not necessary. However, we will also address the 
more complicated problem when valuations are not limited to have finite values and Alice is 
allowed to ask dividing questions, as was discussed. 



(20) 



3.4.2 General Uniform Case 

In this section we assume that Bob's valuation is uniformly distributed in an interval [b min , 6 max ]- 
Note that since the set of possible valuations is infinite and we can not spy for arbitrary functions 
of valuation vectors over the n games, in this case, we can not solve by out result in divide and 
choose. Also for the sake of simplicity we assume the maximums in Definitions 7 and 8 exist. 
One can check that if we replace maximums by superimum and taking suboptimal points, the 
same results hold. 

Our main results in this section are Theorems 2 and 3 as follows. The proofs are given in 
Appendix B.2. 

Theorem 2. Assume a is fixed, b is uniformly distributed in [b m i n , b max ] and we have an interval 
concave A in [b m i n , b max ] which is an upper bound for A* in this interval, i. e. 

Vx, y b mm <x<y< b max A{ (x, y) < A(x, y), (21) 

then for all k > 1 we have 

^■k (Pmiri) b max ) < k/\{b m i ni b max ) . (22) 

We can see how this gives an upper bound on the average improvement in gain in n games 
when Alice can ask R questions on average in each game. 

Corollary 1. Assume a is fixed and Bob's valuation in n games are i.i.d. and uniformly dis- 
tributed in [b m i n ,b max ]. If Alice can ask R questions on average in each game, or totally nR 
questions, and A is the bound of Theorem 2, then the average improvement on Alice's expected 
gain which is averaged over n games is bounded by RA(b min , b max ) . 

This is an immediate result of Theorem 2, since if Alice asks ki questions in the zth game, 
her maximum improvement is, 

1 n i n 

-E A ^-5> A = i?A > (23) 
i=i i=i 

where we have dropped b min , 6 max arguments since they are constant in n games. 

In Theorem 2 we have assumed the existence of an upper bound. The following theorem 
gives an upper bound in a special case. 

Theorem 3. Assume that for b min < x < y < b max , A^ (x,y) is differentiable with respect to y. 
Then A(x,y) defined by 

d 

A(x,y) = max — 1X71,72), (24) 
x<7i<72<y oy 
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where 



T( X ,y) = { iy - x)Al ^ y) V>X > (25) 

o y = x, 



is an interval concave upper bound for A\ . 
3.4.3 Special Uniform Case 

From now on, we will limit ourselves to the case of 1/2 < b min < 6 max < 1. As discussed before, 
the case where < 6 min < b mSLX < 1/2 reduces to this case by changing the order of items. In 
fact, we will only consider the item which is more valuable for Bob. Our main result in this 
section is the following Theorem which is used to completely characterizes the improvement in 
gain in Corollary 2. The proofs of all statements in this section are given in Appendix B.3. 

Theorem 4. For a fixed 1/2 < b m i n < b max < 1, if a is outside the interval (rj, t u ) where 



2b 2 -\-2b b 

/ 7 i \ _ ^ u max ' ^ u maxU 

'u\"mini "max) L , ol ; 



{bmini b r . 



^max 3b mm i - )f 

2b b ■ +2b 2 ■ 1 ' 

^ u max u min ~ ^ u mm 



'mini u maxJ , , j 

'max \ Omin 



3b max ~\~ b r 

then the sequence {A* k (b min , & m ax)}fcL w ^ ere ^ s defined to be 0, is concave. 

Throughout this section, we will assume that a is outside the interval, i.e. a < t\ or a > r u . 
Before getting to prove this, first we will see the important consequence which is an immediate 
result of the concavity of the sequence of improvement in gain, 

Corollary 2. If 1/2 < b m i n < b max < 1, and a is outside the interval (ti,t u ), then the strategy 
of spying either \_R\ or \R\ questions in each game (with the average number of questions no 
larger than R) maximizes the spying gain of Alice. 

Note that this is an immediate result of Theorem 4. Assuming R is integer, if we assume 
that Alice asks ti questions in game i, then her average improvement in gain will be at most, 

1 n 

/ ipvaim ^max) — (b mm , &max) — (^min; ^max) j (^7) 

1=1 

where we have used the concavity of the sequence. Therefore the strategy of asking exactly 
R questions in each game maximizes Alice's gain. In the case where R is not an integer, 
the strategy of spying either |_-RJ or \R~\ questions in each game (with the average number of 
questions no larger than R) maximizes the spying gain of Alice. 



4 Maximum Nash Collective utility function 

In this section we consider an arbitrary society with a government who wants to divide its 
several resources among the citizens. Each person assigns a value for each of the resources 
available to the government, and we assume that the government knows these valuations. The 
Nash collective utility function (Nash CUF) for a given division strategy is equal to the product 
of the gains of individual members of the society of that division strategy. Maximizing the Nash 
CUF for this society implies a division policy for the government, specifying how much of each 
resource should be allocated to each individual. For practical reasons the government may want 
to divide the citizens into several clusters, say drivers, teachers, etc, and apply the same division 
strategy uniformly to all people from the same class. We consider the increase of Nash CUF 
for a clustering refinement and draw conceptual links between this problem and the portfolio 
selection problem in stock markets [7]. 
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4.1 The model 



Assume that the population of the society is n, which is fixed. The valuation vectors of all the 
individuals in the society is known to the government. We assume that the government has 
partitioned the society into k clusters V = (Vi, . . . ,Vk). Let denote the number of people 
in cluster Vi and a, = — . The government has decided to use a fixed division strategy for all 
people in cluster % which is denoted by b«. The sum of the portion each individual receives 
should sum up to one, i.e. ^j =1 Wib, = 1 = X^=i( nQ! i)ki = 1- Let us denote the the valuation 
vector of people in cluster i by v^, v i2 , . . . , v in .. 

Based on the valuation vectors of individuals, the government wants to divide the items so 
as to maximize the Nash CUD of the society, which is 

Wp = maxTTTTb*Vy, (28) 

bl:fc 

In the second scenario, the government divides one of the classes, say the first class, into 
two subclasses la and 16 and uses different division protocols for these subclasses. If V denotes 
the new partitioning and Wp> to be the maximum Nash CUF in the new scenario, 

k m 

w v ,= ; max / n^n^nn^' ( 29 ) 

la ' 16 ' 2:fc v la vu i=2j=l 

By taking b' la = b' lb = hi and h[ = bj for i > 1, we realize that W > W. In fact by refining 
the classification, the government can improve the social welfare, which was expected. In this 
section, we are interested in finding an upper bound on the possible improvement after this 
refinement. 

Define Vj to be the random variable whose distribution is the empirical distribution of the 
valuation vector of people in class i, i.e. for any set A 

P(Vl eA)= \MlIii±A j ( 30) 

also define r.v.'s Vi and V~u to be the random variables for empirical distribution of subclasses 
la and 16. Values of a.\ a and au are defined in a natural way by dividing the size of classes la 
and lb to n. Note that 

p(Vi = vi) = — P (y la = v a ) + — p(Vu = Vl ). 

We can define a random variable E indicating where a randomly chosen person from class 
1 belongs to la, or to 16. In this case p(E = 0) = o.\ a ja\ and p{E = 1) = au/ai. Also 
p(Vi = Vi\E = 0) = p(Vi a = vi) and p(Vi = v\\E = 1) = p{Vib — Vi), which is simply the 
Bayes rule. We denote the support of Vi by the set Vi (i.e. p(Vi = v±) > <^=>- Vi G Vi). 
Similarly we let V\ a and Vu to be the support of Vi a and Vi&. Note that V\ a C V\ and Vu C V\. 

In a more generalized but similar case, we can assume that instead of dividing cluster V\ 
into 2 clusters, we divide it into t clusters 7\i, . . . ,Vi t t and show the new partitioning by V. 
Exactly in the same way, we define random variables E and VV 

4.2 Main Results 

Our main result in this section is the following Theorem which is proved in Appendix C. 

Theorem 5. With the above notations, if we refine the clustering V by dividing cluster V\ into 
t clusters resulting in a new clustering V , we have, 

W v < Wt» < W v 2 niI{Vl ' E \ (31) 
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Remark 2. Since V is a refined version of V , the lower bound on W-p> is expected. To intu- 
itively understand the upper bound, note that a good clustering ofV\ puts valuation vectors that 
are geometrically close to each other into the same cluster. Therefore knowing that a person 
is in a certain cluster V\ 7 e for some E should provide some information about the geometrical 
location of the valuation vector of the person. Thus I(V\,E) is large for a good clustering. 
However a large I(Vi;E) does not necessarily imply a good clustering. Such information the- 
oretic interpretation of clustering (traditionally a topic of data mining and machine learning) 
may be new (we have not seen it) and it may be of independent interest. 
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A Proofs for Divide and Choose 

A.l Proof of the achievability of the rate gain region 

In this section we prove that TZ(r) C lZ(r). Note that since lZ(r) is closed by definition, it 
suffices to prove the following, 

Theorem 6. If there exist finite random variables Fi,...,F r and division strategy D with 
values in V where (F[ 1:r ],D) e T(r) and the rates R AB and R BA are chosen so that 

Rab > I(V A ;F [1:r] \V B ), 
R BA > I(V B ;F [1:r] \V A ), 

E[g A (D,V A \\V B )] = G A , [ ' 

E[g B (D,V A \\V B )] = G B , 

then the rate gain tuple {G A ,G B , R AB , R BA ) is achievable for the r-round negotiation problem. 

For proving this, we will use the existing results on the Empirical Coordination version of 
the channel simulation problem which is summarized as follows. Assume two terminals have 
samples of random variables X\ and X 2 with joint pmf p{x\,X2). The goal is to simulate the 
channel p(yi, 2/2 1 ^1? ^2) and generate Y x and Y 2 in terminals 1 and 2 respectively. Since the first 
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terminal has only access to X\ while Y\ is dependent on both X\ and X2, which is the same 
story for the second terminal, the two terminals need to communicate with some rate in order to 
gain information about the other terminal so that they can simulate the channel. This process 
could be done during r rounds in n consecutive i.i.d. samples, i.e. p(x™, x 2 ) = YYi=i v( x i,i, x 2,i)- 
In the coordination version of the problem, the two terminals are supposed to generate jointly 
typical sequences of and Kf with X™, Xg with high probability. Yassaee et al. have derived 
the rate region for this problem in [8]. 

Proof. Substituting X\ by V A , X 2 by V B , Y\ by D and Y 2 by a constant, say 0, the empirical 
coordination rate region in [8] guarantees that if the following conditions are satisfied, then for 
a given 8 > and N, there exists a (n, Rab, Rba) code with n > N such that (V A , Vg, D n ) are 
8 typical with probability at least 1 — 8. 

V B - V A , - F p p odd, 

V A -V B , F[ ltp -i] - F p p even, 
V B ,Y 2 -V A ,F 1:r -D, 

V A ,D-V B ,F [1 „ ] -Y 2 , 1 ' 

Rab > I(V A ;F [1:r] \V B ), 

Rba>I{V b -F {1 . t] \V a ). 

Note that the above conditions are either among our assumptions or result from the fact that 
Y 2 is constant, therefore are all satisfied. Using properties of typical sequences, 

1 n 

\G A -G a \ = \~Y^Ga (A, V A>i \\V B>i ) - E [Q A (D, V A \\V B )] | < 5G A < 5, (34) 
i=i 

since gains are bounded by 1. The same inequality holds for Gb- This proves the achievability. 

□ 



A. 2 Proof of the converse of the rate gain region 

In the following Theorem, we prove that TZ(r) C TZ{r). The converse is similar to the one given 
in [8], although the two problems are not identical. Therefore, we omit the proof of the common 
part and refer the reader to [8] for more details. Note that since the statement in the following 
Theorem is true for all 5 > and also inequalities could be substituted by strict inequalities 
by subtracting 6 in the right hand side, (Rab, Rba, G A , Gb) falls in the closure of 7Z(r) which 
is equal to 7Z(r) by definition. 

Theorem 7. If a rate gain tuple (Rab, Rba,G a ,Gb) belongs to the rate gain region with r 
rounds of communication, then for each 5 > there exist (Fi :r , D) G T(r) such that 

Rab > F(V A ; F[x :r ]\V B ) , 

R B A>I(V B ;F [l:r] \V A ), 
\E[g A (D,V A \\V B )]-G A \<5, [ ' 

\E[Q B (D,V A \\V B )]-G B \<8. 

The converse has much in common with the proof of the converse in [8] by setting X\ = V A , 
X 2 = Vb, D = Y\ and Y 2 = in their terminology. Therefore we omit the common parts. 

Proof. Since the tuple (R A b, Rba,G a ,Gb) is achievable, for 8 > and N, there exists a 
(n, R A b, Rba) code with n > N with communication variables Cu. r ] and division D n such that 
its average gains G A and Gb satisfy 



n 

/° vcn (36) 
- V H(C P ) < R BA . 

p odd 
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Also with probability at least 1 — 5 \G A — G A \ < 5 and \Gb — Gb\ < 8. 

Now define the auxiliary random variables Fi, 1 < i < r and D. Take Q to be a random 
variable independent from all other random variables and uniformly distributed in [1 : n] and 

F i = C i V A[Q+1 . n] V B[1:Q _ 1] Q, 
D = D Ql 

note that since Q is uniform and independent from all other random variables and V A and V B 
are i.i.d. therefore V Aq = Va and Vb q = Vb- We claim that (F[i :r ],D) G T(r), for proving this, 
we need to show the following Markov chains: 

Va ~ V B , ^[ni-i] - Fi i odd, (38a) 
V B - V A , i^i^-i] - F t i even, (38b) 
Vb-V a ,F [x . a -D. (38c) 

The proofs for (38a) and (38b) are identical to that of [8]. Note that the role of D is Markov 
chains is as if we had a r + 1 th random variable F r+ \ = D n , therefore like (38a) and (38b) we 
can show that 

V B - V A ,F [1:i] - D n , V A[Q+1:n T/ B[UQ _ n Q, 

therefore 

V B -V A ,F [1:i] -D Q . 

which is what we wanted to prove. 

Just like the converse in [8], we can show that 

Rab > F(Va; F[x :r ]\V~B), 
Rba>I{V b ;F {1 . t] \V a ). 

In order to complete the proof note that, 

|E \Q A (D, Va\\V b )} - G A \ = \E [G A (D q , Va,q\\V b , q )} - G A \ 



(39) 



1 n 

-J2 E &a (D q ,V A jV B , q )]-GA 



n 

q=l 



(40) 



\G A -G A \<8. 



Following a similar procedure \Gb~ G b \ < 5. Therefore F\i :r ] and D satisfy all our requirements, 
which completes the proof. □ 



A. 3 Proof of Cardinality Bounds for Theorem 1 

Before proving the cardinality bounds, we need to prove the following: 
Lemma 1. The rate gain region 7Z(r) defined in Theorem 1 is convex. 

Proof. Since the closure of a convex set is convex, it suffices to take two tuples (G\, G B , R AB , R 1 ba) 
and (G A , G 2 B , R AB , R 2 B a) i n ^( r ) which are produced by Di, F^, and D 2 , F£. r , respectively and 
< A < 1, and show that 

{G\, G b , R\b, Rba) = ^(^a, ^b, R\b, Rba) + MG A , G b , R 2 ABl R\ a ), 

is in lZ(r) where A = 1 — A. Define a binary variable Q independent from all other random 
variables which is equal to 1 with probability A and 2 with probability A. Define new variables 
Fi = (i^ Q , Q) and D = D Q . We have, 

I(V A ;F [1:r] \V B )^I(V A ;Fg. r] ,Q\V B ) 

= /(Va; Q\V b ) + I(V A ; F® r] \V B , Q) 

= + XI(V A ; F[ l:r] \ V B , Q = 1) + XI{V A ; F*.. r] \V B , Q = 2) 

= \I(V A ; F^ r] \V B ) + XI(V A ; F? 1:r] \V B ), 
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using this we have, 



> XI(V A ; F{ Vr] \V B ) + XI(V A ; F? 1:r] \V B ) 
= I(V A ;F [1:r] \V B ), 

using exactly the same method we have R BA > I{Yb\ F\\ :r \\V B ). Also, 



(42) 



E 



Q A [D,V A W B 



E 



E 



Qa[D,V a \\V b 



Q 



AE \Q A (D u V A \\V B )} + AE [Q A (D 2 , V A \\V B )\ 
XG\ + AG^ 



(43) 



G A 



A- 



substituting A by B we have G B = E \p B \ D, V A \\V B J^ . It remains to prove that (F[i :r ],D) G 
T(r). For odd % we have, 

I(F; Va| Vb, = J(F 4 Q , Q; V^Vb, F^,, Q) 

= I{F?-V A \V B ,F$._ XV Q) 



J2 I(F?\ V A \V B , Q = g)P [Q = g] 



(44) 



3=1 



= XliF 1 ; V A \V B , Ff 1:i _ 1} ) + AI(ff ; V A \V B , F^_ 1} ) 
= + = 0, 

where we have used the fact that Q is independent from other random variables, therefore 
V A — V B , — Fj. Using the same way we realize that V B — V A , — Fj. Now we show 

that Vb - Va, F[i :r] - D: 

I(D; V B \V A , F [1:r] ) = I(D Q , Q; V B \V A , F® r] ,Q) 

2 

= HDq; Vb\V a , Fg p] , Q = q)P [Q = q] 



q=l 



(45) 



AJ(D i; Vb|Va, if lp] ) + AI(L> 2 ; VB|y A , if 1: 
+ = 0, 



therefore (F[i :r ],.D) G T(r) and the proof is complete. 



□ 



Proof of the cardinality bounds. Let C(r) be defined as TZ{r) with the cardinality bounds im- 
posed, and convex hull taken. Since there is the cardinality constraint for C(r), we have 
C(r) C TZ(r), therefore it suffices to prove that lZ(r) C C(r). From Lemma 1 we have 7t(r) is 
convex, it suffices to show that for all Ai, A2, A3, A4 G R 

sup AiG^ + A2GB + \3RjiB + X^R BA < sup AiGa + A2GB + X%R AB + A4FBA, 

£(r) C(r) 

for this, we show that if we take (F[ 1:r j, D) G T(r) we can reduce the cardinality of |Fj| so that 
the value of g(F^. r ], D) defined as 



g(F [1:r] , D) = AiE [0 A (-D, Vk||Vb)] + A 2 E [Q B (D, V A \\V B )\ 
+ X 3 I(V A ;F [1 ._ r] \V B ) + X 4 I(V B ;F [1 ,. r] \V A ), 



(46) 



does not decrease. We show that the cardinality of J-j could be reduces to the desired value by 
induction on %. We use cardinality bounding methods as introduced in [9]. At step i we change 
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the joint probability Pi-i{vA, Vb, d, /[i :r ]) into Pi(va, Vb, d, /[i :r ]) so that the cardinality of J 7 , 
reduces to the desired value and the marginal distribution of pi(v a, vb, fii-.i-i]) does not change, 
therefore the cardinality of Jrui-ii remains unchanged. If indicates the expression defined in 
(46) in step i we show that during this process > gi-\. Also note that the distribution at 
step 0, i.e. p (v A ,v B ,d, f[i- r ]), denotes the initial probability distribution. 
We define the probability distribution at step z as 

Pi(v A ,v B ,dJ [1:r] ,d) = Pi(fi)Pi-i(v A ,v B ,dJ r \ l \fi), (47) 

in fact, we only change the sequence {Pi(/i)}/ie^i an d l eave the conditional distribution of other 
random variables unchanged. Define At to be set of sequences {Pi(fi)} foe^ such that for the 
induced distribution pi, we have 

Pi(v A ,V B , = Pi-l(VA,V B , /[l:t-l]) Vt>A, V B G V, G Tj % j < i, (48a) 

Va - Vb, - F P P odd, (48b) 

V B - V A , i^p-i] - P even, (48c) 

Vb — Va, F[i :r ] — D, (48d) 

^ P*(/0 = 1, (48e) 

Pi(/<)>0 V/iGJi, (48f) 

note that {Pi-i(/i)}/ i e^ j G *4j, therefore .A, is not empty, also conditions (48c) and (48f) 
quarantine that pi is a probability distribution. Now we simplify the conditions in (48). First 
assume that i is odd, we claim that the reduced set of constraints 

Pi(VB, /[l:t-l]) = Pi~x(VB, G V, /j G J}, j < Z, (49a) 

P*(/<)>0 V/iGJi, (49b) 

note that this set of conditions is necessary for (48), to show that they are sufficient, assume 
that the conditions in (49) hold, first we begin by showing (48e) to make sure that pi is a 
probability distribution. Using (47), (49a) and the fact that pi_ x is a probability distribution 
we have, 

J^Piifi) = ^Piifi) Y Pi-i( v B,f[i-i]\fi) 

fi fi v B,f[l:i-l] 

= Y 5^Pi(/0Pi-l( u B ; /[i-l]l/i) 

«Bl/[l:i-l] fi (50) 

= Y 5^Pi-l( U B ; /[i-l] ; /i) 

»B,/[l:i-l] fi 
= 1, 

then we show that (48a) is true. For this, take an arbitrary v A G V, 

Pi(vA,VB,f[l;i-l]) = 5^Pi(/i)Pi-l(fA,'y J B ) /[l:i-l]|/i) 

fi 

= 5^Pi(/i)Pi-l(^B, f[l:i-l]\fi)Pi-l(v A \v B , fi) 



fi 

(a) 



A (51) 

Pi-lM^B, 53Pt(/i)P<-l(Ufl, /[l:t-l]|/<) 

/< 

Pi-l(v A \v B , f[l:i-l))Pi(v B , /[W-l]) 
Pi-l^A^U, /[l:i-l])Pi-l(«fl, 
P<-l(«A,Ufl,/[l:i-l]), 
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where in (a) we have used the fact that since i is odd, the induction hypothesis quarantines 
that Va — Vb, i^i-i-i] — Fi for the distribution at step i — 1, i.e. Pi-\. 

Now we prove the correctness of (48b). If p < i, according to (48a), the marginal distribution 
of va,vb, f[i-. p ] is unchanged and the statement is true based on the induction hypothesis. Now 
assume p > i, we have, 

Pi(v A \vB, f[l:p-l], fp) = Pi-1(VA\VB, fp) 

= Pi-l(v A \vB, f[V.p-l]) ( 52 ) 
= Pi(v A \v B ,fll:p-l]), 

where (a) is true since fi is present in the condition, (b) is true since the Markov chain holds 
for Pi-\. If p > i, fi is present in the condition in (b) showing the correctness of (c), otherwise 
all the random variables have indices less than i and hence using (48a) we realize that (c) is 
correct. The proof of (48c) and (48d) are the same. 

Therefor we have proved that Ai is the set of vectors {pi(fi)} i n ^'" Fi ' satisfying (49). 
We can rewrite the conditions in (49) as, 

^Pi(fi)Pi-i(v B , = Pi-i(v B , V^b G V, fj G < i, (53a) 

fi 

Pi(fi)>0 V/iGJi, (53b) 

which is a set of \J~i\ + ki where ki = \V\ YYjLi l-^j'l linear inequalities. Therefore is a 
polytope in M'^L The vertexes of this polytope have at most k nonzero element, representing 
a distribution of cardinality at most ki which is desired. It remains to prove that at least for 
one of these distributions, $ > g^i, note that using the definition of (46), 

gi (F [1:rh D) = X 3 H(Va\V b ) + \4H(V B \V A ) w(/i), (54) 

fi 

where 

c fi = AiE [Ga (D, Va\\V b ) \Fi = fi] + A 2 E [Q B (D, V A \\V B ) \F t = f % ] 

-\zH{V A \V B ,F r \\Fi = fi) (55) 
-X^HiVslV^F^^F^fi), 

note that since, conditioned on fi the distribution is the same as and is unchanged, there- 
fore the terms Cf t and \zH(Va\Vb) + MH(Vb\Va) are constants and gi is a linear function of 
{Pi(fi)}fieTi plus a constant. We know the value of gi at {pj_i(/j)}/ i£ jr. G is a linear combi- 
nation of its values on the vertexes. Hence, at least for one vertex of Ai, the value of is not 
less than 

The case of i even is exactly the same and the proof is complete. □ 

B Proofs for Adjusted Winner 

B.l Deriving Adjusted Winner formulation for two goods 

When there are only two goods, by changing their ordering, we realize that AW (a, b), which 
is vector of size 2, is the reverse of AW (1 — a, 1 — b). Therefore it suffices to analyze the case 
when b < 1/2. We will take three cases: 

Case I, < a < b: Since the valuation of Bob is more than Alice in the first good and the 
valuation of Alice is more in the second good, the initial allocation is 





df ' 




' 


i " 


4 






i 
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where Alice's gain is 1 — a and Bob's is b. Since Alice's gain is more, a portion of the second 
good should be given to Bob. Solving the equations, the final allocation would be, 



i 



l-a- 



2-a-b 2-a-b 



It should be noted that in the case of a = b, there is no unique allocation, since in that case 
the initial allocation is giving all the goods to Alice, but we can start with either the first good 
to give to Bob or the second one, therefore any of the following allocations is feasible, 



1 

1-26 
2-26 2-26 





1 



1 

1- 26 

2- 26 2-26 




1 



which give us exactly the same gain. We note that we would get the second allocation instead 
of the first if we took the case of a = b in Case II (discussed below). Therefore the AW function 
is not well defined when the valuations are identical. 
Case II, b < a < 1 — b: the initial allocation is 

1 
1 

where Bob's gain is 1 — b which is greater than that of Alice which is a, therefore a portion 
of the second good should be given to Alice. Solving for equality we get the following final 
allocation 

1 

l-a-6 1 



L 2-a-6 2-a-6 

Case III, 1 — b < a < 1: the initial allocation is 

10 
1 

where Alice's gain is a which is greater than that of Bob which is 1 — b, therefore a portion of 
the first good should be given to Bob. Solving for equality, the final allocation would be 

JL i _ JL 

a+b a+b 

1 

When b > 1/2, by considering AW (1 — a, 1 — b) and reversing the answer, we can find the 
allocation in general: 



AW (a, b) 



f(0.5zb) 
(1-^1) 

1(0- to) 



a<bAb< 1/2, 
b <a<l-b Ab< 1/2, 
l-b<a<lAb< 1/2, 
Ka<lA&> 1/2, 
l-b<a<bAb> 1/2, 
a < 1 - b A b > 1/2. 



(56) 



Taking the similar terms together and neglecting the cases when a = b which is not well defined 
as discussed before, we get the following simplified formulation: 



f(° 



AW (a, b) 



< a < min(l — b,b), 
max(l — b, b) < a < 1, 
(l,§EfEf) b < a < 1 -bAb < 1/2, 
k (1-^,1) l-b<a<bAb> 1/2. 



1 ) 

2-a-b) 



(57) 
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Note that as discussed before, the special case when a = b does not result in a unique division, 
and we have taken one of the possible cases. However, as we will see later, the case of a = b is 
not interesting for us, therefore this conflict is acceptable for the purpose of our study. 

An interesting fact is that, the four above cases are not independent. In fact the two 
following equalities (which are true, even when m > 2) relate these four cases: 

AW (1 -(a,b)) = AW (a,b) r , 

AW((a,6) r ) = l-AW(o,6), 1 ' 

where the reverse operator acts as (a, (3) r = ((3, a). Note that these are simply the case where 
the ordering of players or the placement of items are altered. 



B.2 General uniform case 

First we prove some tools. First we start by the following observation regarding the \&* func- 
tion. The optimal value of a when Alice knows Bob's valuation has been analyzed formerly, a 
discussion could be found in [1]. 

Proposition 1. Assume a and b are fixed. Then *ff(a,a\\b) is a concave function of a if 
min(6, 1 — b) < a < max(6, 1 — b) and convex when a < min(6, 1 — b) or a > max(6, 1 — b). Also 
it is increasing when a < b and decreasing when a > b. Furthermore, 

\im x ^ b + \1> (x, a\\b) a > b, 
sup* (5, a\\b) = { linv+ft- ^ (x, a\\b) a < b, (59) 
*(6,6||6) a = b. 

In fact this shows that if a > b, the optimal value of a is b + = b + e, and when a < b, the 
optimal value is a = b~ = b — e. In fact in these two cases * does not have a maximum. It 
should be noted that the AW function is not well defined when a = b and a ^ b. 

Proof. First we give the exact formulation of * using (12) and Definition 5, 



* (a,a\\b) 



a+b 

First assume b < 1/2. In this case, 



2^ 0<a<min(l-M), 
gfpjj max(l — b,b) < a < 1, 

a + (1 — a)|5§5f b < a < 1 - b A b < 1/2, 
1 -b < a < bAb > 1/2. 



(60) 



9 -^r < a < b, 

2—a—b — — ' 

*(a,o||6) = { a + (l -a)iE§Ef b < a < 1 - b, (61) 

.A a 

which is increasing in a < b, decreasing mb < a < 1 — b and 1 — b < a, also the limit of the 
second case when a goes to 1 — b from left is equal to a which is equal to the value of the third 
case for 5 = 1 — 6. Therefore the function is continuous everywhere expect possibly in b. The 
left and right limits at b are (1 — a) /(2 — 2b) and (1 + a — 2b)/ (2 — 26) respectively. We see that 
the left limit is greater when a < b, they are equal when a = b and the right limit is greater 
when b < a, which shows (59) in this special case. The concave/convex statements are evident 
from the expression. 



Now assume b > 1/2, we have, 



< a < 1 

2—a—b — — 



tf(5,a||6) = <l-j£t l-6<5<6, (62) 



d+b 

=ze 6 < a < i, 

a+b — — ' 
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which is increasing in < a < 1 — b and 1 — b < a < b and decreasing in b < a. The limit of the 
second case and the value of the first case are both equal to 1 — a at a = 1 — b, therefore the 
function is equal at that point. The left and right limits at b are 1 — a /2b and a /2b respectively, 
therefore left limit is greater when b > a, the right limit is greater when b < a and they are 
equal when a = b, which again verifies (59). Again, the concave/convex statement are evident 
from the expression. □ 

Using this Proposition, we can conclude the following statement which justifies Definition 7. 
Corollary 3. The optimum value for a for ^> (a, a\\b m i n ) b max falls in [b m i n , b max ], i.e. 

max * (a, a\\b min , b max ) = max ^ (a, a\\b min , b max ) . 

0<a<l b min <a<b max 

Proof. Assume a ^ [»mm)°max]- First assume a < o min . As we have shown in Proposition 1, 
^ (a, a\\b) is increasing in [a, b) for all b G [o min , o max ]. Therefore, 

* (a,a||o min ,o max ) = - — - — / V(a,a\\b)db 

"max "min Jb m i n 
<T 7— / ^(&min,a||6)d6 1 ' 



h — h ■ 
"max "min J b min 

\l/ (o mm , a||o mm , ^max) , 

hence the maximum can not happen at this a. The proof for the case where a > o max is similar 
using the fact that \l/ (a, a\\b) is decreasing in (o, a] for all b G [bmin , b m3X \. □ 

Now we analyze the behavior of the sequence of maximum improvements by asking an 
specific number of questions, A* k . We expect that by asking a number of questions, the expected 
gain for Alice increases, and the more questions she asks, the more is this improvement. The 
following proposition establishes this. 

Proposition 2. Assume a is fixed and b is uniformly distributed in [b m i n ,b max \. Then the 
sequence A* k (b min ,b max ) for k > 1 is nonnegative, nondecreasing and bounded by 1, i.e. 

< AJ < A* < ••• < 1. (64) 

Proof. First we prove that for all o min < b < b 2 < o max and b < bi < b 2 , A x (b ,bi,b 2 ) > 0. 
Assume that a is the announced valuation that maximizes \l/ (a, a||on, b 2 ), we have, 

Ai (&o, h, b 2 ) = ^-^** (a||6o, h) + ^— (a\\b h b 2 ) 

\ - b 

- T — & a \\bo, h) + — r 1 * (a, a||6i, b 2 ) 

Oo — On 



h- 


bo. 


b 2 - 


bo 


- 


(a 


h- 


bo. 


b 2 - 


bo 




a, 


1 





OO, 2 J 

1 * (a, a\\b) db + - 1 - [ * fa, alio) db 



(65) 



bo Jb b 2 — b j bl 

b 2 

* (a, alio) db 



b 2 ~ b j bo 
= 0. 

Maximizing over b±, A* (on,o 2 ) > 0, and also substituting bo = o m ; n and 6 2 = 6 max , we realize 
that A* (6 min , 6 max ) > 0. Using this, we will show that A* k (6 min , 6 max ) > A^_ x (o min , 6 max ). 
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Assume 6 m i n = bo < ■ ■ ■ < b 2 k = 6 max are arbitrary division points, we have 
A* k (b , b 2 k) > A k (b , h, ... , b 2 k) 

- A fc _i (6 , 62, • • • , b 2 k) + A fc _i (6 , 6 2 , • • • , b 2 k) 

Ebi — bi_ 2 1 bi-i — 6j_ 2 



i=l 
i even 



(66) 



+ A fc _! (60,62, • • • ,6; 



2* , 



» — \^Ai (6i_ 2 , fti-i, &<) + A fe _i (6 , 62, 
^ 6 2 fe - 6 



i=l 
i even 



> A fe _i (6 ,6 2 , .. .,b 2 k) . 

Maximizing over 6 2 , 64, ... , 6 2 fc_ 2 we conclude that A* k (60, 6 2 fc) > A^_ x (6 , 6 2 *). 

Note that since gains are all bounded from above by 1, the \I/ functions are bounded from 
above by 1. Thus the maximum of their expected values, \J r * are bounded from above by 1. 
Hence, by definition, A* k < 1. □ 

Now we have sufficient tools to prove Theorems 2 and 3. 

Proof of Theorem 2. We prove this by induction. In fact we prove a stronger statement; we 
claim that for all k > 1 and 60, 6 2 fc such that 6 m i n < &o < 6 2 fc < 6 m ax, 

A* k (bo,b 2 k)<kA(bo,b 2 k), (67) 

which reduces to what we expect by substituting 6 = 6min and 6 2 fc — 6 max . Note that for k — 1 
this reduces to (21) which is assumed to be true. Now assume it is true for k — 1. If 6 1; . . . , 6 2 fc -i 
are the divisions which maximize A£ (6 , 62*), we have: 

Afe (6 , b 2 k) = A k (b , bi, ... , b 2 k) 
2 k 

~ b 2 k - b 

1=1 

b 2 k-i — bo . , u , v 6 2 fe — 6 2 fc-i /, \ 

= — — A fc _i (o , ... , b 2 k-i ) H — A fc _i (02*- 1 , • • • , 6 2 fc J 

o 2 k — bo b 2 k — b 

b 2 k — b b 2 k — bo 

(a||6o,6a*) 

b 2 k-i — 60 . , . 6 2 fc — b 2 k-i , . 

= — — A fc _! (6 , . . . , b 2 k-i ) H — A fc _! (b 2 k-i ,...,b 2 k) 

b 2 k — bo b 2 k — bo 

+ Ai (b ,b 2 k-i,b 2 k) 

6 2 fc-i-6o A * /, , \ 1 b 2 k-b 2k -i 

< -7 jr A k-i (00, fo 2fc-i) + —r J— A k-i (b 2 k-i,b 2 k) 

b 2 k — bo b 2 k — bo 

+ A\(bo,b 2 k). 
Now by using the induction hypothesis 

Al(bo,b 2k ) <(*-!) (^^A(6 ,6 2 ,-0 + b -^^A(b 2 k- 1 ,b 2 k)) 

\ b 2k — b b 2 k —b J (69) 

+ A(b ,b 2 k). 
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(68) 



Since A is interval concave, 

A* k (b ,b 2k )<kA(b ,b 2 k)- (70) 

□ 

Proof of Theorem 3. We know from Proposition 2 that A* is bounded, therefore V is continuous 
at x — y. Furthermore, since A^ is differentiable with respect to y, for a fixed x, it is continuous 
with respect to y. Therefore for b > a, T(a,y) is continuous when y changes in [a, b] and 
differentiable in (a, b) as A^ is. Using mean value theorem, there exists a y* 6 (a, b) where 

r(a,b) = (b-a)j-T(a,y*). (71) 

Now by the definition of A, 

d 

(b - a)A* (a, b) = T(a, b) = (b - a )^ r K V*) < (& ~ a)A{a, b), (72) 

hence for any a < b, A^ (a, b) < A (a, b) and therefore A is an upper bound on A^. 
It only remains to prove that it is interval concave. Note that if x < t < y, then 

d d 
A(x,t) = max — r(7x,7 2 )< max — T^, j 2 ) = A(a;, y), (73) 

x<7i<72<t (7?/ 2<71<72<S/ (7?/ 



likewise 



d d 
&(t,y) = max — r(7i,7 3 )< max — r^, 72) = A(z, y), (74) 

*<7l<72<2/ (7?/ ^<71<72<S/ ay 

therefore, 

— A(x, t) + ^A(t, y) < A(x, y), (75) 
y — x y — x 

which shows that A is interval concave. □ 



B.3 Special uniform case 

First we prove some tools. In this special case when 1/2 > 6 mm , the integral in (14) could be 
computed and the following properties could be easily derived by taking the first and second 
derivatives. 

Lemma 2. If 1/2 < b m i n , then for b m i n < d < b max we have, 



a l0 S ( (a+b ma t)(a+b mm ) ) a + ^ax 



W (a, a\\b min , b max ) = — , (76) 

"max ®min 

is concave in a, therefore it has a unique maximum. Furthermore if a > T u (b m i n ,b max ) then 
the derivative is positive inside the interval and therefore the maximum happens at b max and 
if a < T i(b m i n ,b max ) the derivative is negative inside the interval and therefore the maximum 
happens at b min . 

Proof. Using the expressions in (59), we have 

6„ 



* (5, a||6 min , 6 max ) = — — / * (o, o||6) db 

"max "min Jft m ; n 



dx+ I : dx ( 77 ) 



bma,x b mm \Jb- a ~\~ x J g \ d -\- x 

0- log ( jt—t — 4 ff- . r r ) — CL + 6 n 

^max ^min 
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Omitting the linear terms, we need to show that log ( j^-t — f, - , , — r ) is concave in a, the second 
derivative is equal to 



1 1 2 

+ 



r,2 



(Km + a) 2 (6 max + a) 2 a 

1 1 \ / 1 1 , 

+ T, ^To - ^ < 0, 



(&min + a) 2 a 2 / \(b mSLX + a) 2 a 2 



(78) 



which shows the concavity. 

Now assume that a > r u . Since the function is concave, it suffices to show that the derivative 
is positive at a = 6 max . The first derivative is equal to, 



max ) ~ 2a6 m ax a 

a(f>min-&max) a+^min 



a + 6 n 

Substituting a = 6 max , 



(79) 



26 max (6 max "I" ^min) + "(^max + 36 mm ) 

26 lb 2 - b 2 ) ' ^ ^ 

^"max^max u mm) 

Note that the denominator is positive since 6 max > 6 min > 0, therefore expression is greater 
than or equal to zero if and only if a > r u . For the second case, again since the function is 
concave, in order to show that the maximum happens at 6 mm , it suffices to check the derivative 
at a = 6 min which is equal to, 



fl(36 max -|- 6 m ; n ) 26 max 6 mm 26 m - n (Q1\ 

2b ■ (b 2 - b 2 ) ' ' ' 

Again since the denominator is positive, the first derivative is less than or equal to zero if and 
only if a < T\. □ 

Lemma 3. (a) If 1/2 < b m i n < b max < 1, the thresholds T\ and r u satisfy 

n < b min < b max < t u , (82) 

(b) If [b , h] is a subinterval of [b min , b max ], i.e. b min < b and b\ < b max , then 

n(b min , b max ) < rj(6o, 6i) < r u (b , 6i) < r u (b min , b max ). (83) 

Proof. If s denotes the ratio of endpoints, 6 max /6 min , we see that, 

, 2(s + l) 



'max , o 

s + 3 



n = b - 2 -^±V 

1 1 "mm „ 1 • 

3s + 1 



(84) 



For part (a), note that 2(s+l)/(s + 3) > 1 for s > 1. Since s = 6 max /6 min > 1 and t u > 6 max . 
Similarly, for s > 1, 2(s + l)/(3s + 1) < 1 which shows that r\ < 6 mm . 

For the second part, if s' denotes 61/60, we have 61 < 6 max and s' < s, therefore (84) and the 
fact that the function 2(s + l)/(s + 3) is increasing show that r u (60, 61) < r„(6 mm , 6 max ). Simi- 
larly, since 6 min < 6 and the function 2(s+l) / (3s+l) is decreasing, T;(60, 61) > Ti(b min , 6 max ) □ 

The following Lemma gives a simple expression for ty* in this special case. 

Lemma 4. With conditions of Theorem 4, we have the following formulation for ty* , 

** (a\\b mm , b max ) = { tt . War n G - Tn ' (85) 

M b ™*+"™J +1 a < r { . 



alogi t_ 

v Q7nax+o m i n 



'-'max u rntn 



31 



Proof. As we have shown before, ^> (a, a||fe mm , 6 max ) is differentiable and concave in a, therefore 
its maximum value either happens at endpoints or could be obtained by setting its derivative 
equal to zero. However, since the function is concave, the maximum happens at 6 max if and 
only if the derivative is nonnegative entirely in the interval, which reduces to the condition that 
the derivative is nonnegative at 6 max . Simplifying this condition, we realize that this happens 
when a > r u , therefore substituting a = 6 max we get the expression for the first case. Using a 
similar method and by setting the derivative at b m i Q to be less than or equal to zero, we get the 
second case. □ 

In the next Lemma, we derive the exact form of A£. 

Lemma 5. For a and [b m i n , b max ] fixed, the geometric sequence bo, ■ ■ ■ , b 2 k where bo = b m i n , 
b\k = b mn!r . and 



maximizes A* k (b m i n , b„ 



log&,= (l--llog& + ^;log& 2fc Kt<2 k , (86) 



Proof. We will take two cases, a > t u or a < T\. First assume that a > r u . Using Lemma 4 we 
have, 

A k (b , ...,b 2 k) = J2 h r\ 1 ^* H h i-^ h *) - («ll & o, b 2 k) 
~f 2 k — Oo 

«iog /2 ""' 1 (87) 



b 2 k — bo b 2 k — bo 



where the last equality holds since is a subinterval of [& mm , b mSiX \ and hence using 

Lemma 3, a > r u (&j_i, b ). Note that a, b and b 2 k are constant, therefore by defining Sj = foj/foj-i 
we should maximize the following, 



(88) 

i=l 



Since log is increasing, in order to maximize this, we need to minimize a where, 

n (i + tY < 89 > 



a 

8=1 



If we define Sj = since % is a geometric sequence, 

(90) 



b \ 1/2fc 



'max 



brail 

Now define pi = logs« and pi = logSj. Note that pi is a constant sequence. In fact, since 
Y\si = JJsi = &max/&min, 12 Pi = 12 Pi = log &max - log& min . Also h is geometric, hence for all 
1 < j < 2 k 

. _ log ftmax - log ^min _ EjU Pi / Q1 \ 

Pi ~ ok ~ ok ■ 
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Now, by defining f(x) = log(l + e x ) which is convex, 

log a = ^ log I 1 + — 

;=i \ s ^ 

i=i 

2 fc 




(92) 



i=i 

where (a) uses Jensen's inequality and the fact that f(x) is convex, (6) uses (91) and (c) uses 
the fact that is a constant sequence. Thus hi minimizes a or equivalently maximizes 
Now considet he case where a < Tj. In this case we have, 

A fc (6 , . . . , b 2k ) = V ^"V ** (aH^x, &<) , -vl/* (a||6o, M 

o 2 k — o 

26o 



» 2 fc +&0 



8=1 



where again we have used Lemma 3 which guarantees that a < Tj(&j_i,&j). Omitting the 
constant terms, we should maximize 

Since log is increasing, we should minimize (3 = Yll=i 1 + s «- By defining g(x) = f{—x) = 
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log (1 + e x ) which is convex, we have 

2 k 

log/3 = ^log(l + s 4 ) 

i=i 

2 fc 

= $^log(l + exp(pi)) 
i=i 

2 k 



i=1 (95) 

2fe 



2 k 

*2 k 



w k n ( E i= iP 



2 A ' 



2 fc 

, „ t ,, ;! . 

i=l 



where (a) uses Jensen's inequality and convexity of g>, 6 uses (91) and (c) uses the fact that pi 
is a constant sequence. Thus 6, minimizes /3 or equivalently maximizes A^. □ 



Remark 3. Note that this lemma shows that the optimal series of divisions for k questions is 
exactly the same for that of k — 1 questions together with the optimal dividing question for each 
of the 2 k ~ 1 subintervals. 

Lemma 6. If a £ [Ti(b m i n ,b max ),T u (b min ,b max )}, then A\ is interval concave. 

Proof. Assume b > 6 min and b 2 < & max , as a result of Lemma 3 part (b), a [ri(b , b 2 ), T u (b , b 2 )]. 
We can derive the formulation for A2. Using Lemma 4, for the case of a < Tf. 

2 7 7 

Ai (6 , 61, 62) = E 4 JT~^* ( a \\b*-i, h) ~ ^ (a\\b , b 2 ) 



2 



v " \o, ; +Oi_i y " y 02+00 y (96) 

r-/ &2 - &2 - &0 

2=1 



a log 



2bi (02+00) 
(oi+6 2 )(6i+6 ) 



62 - 



Similarly, for the case of a > t u 



2 b - b 

Ai (60, &i, 62) = J2 i r 1 ** ( a \\bi-i, bi) ~ (a\\bo, b 2 ) 

b 2 -b 



2 



v u \6i+0i_i y u ^02+00 y (97) 

^ &2 " &0 62 - ^0 



a log 



26i(6 2 +6 ) 
(61+62) (61 +6 ) 



62 - 

We observe that Ai {bo,bi,b 2 ) is the same in the two cases. Using Lemma 5 and substituting 

h = a/&0&2, 



a log 



2(62+60) 
(V6HV60")" 



A^ (6 , b 2 ) = " 7 7 7 ■ (98) 

02 — o 
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1.1 1.2 1.3 1.4 



Figure 15: The plot of f(s) as defined in (100) for 1 < s < V2 



By defining s = y^Jbo, the ratio of interval endpoints, we can rewrite A^ in the following 
form, 

Oo s — 1 

We show that if b < b x < b 2 , then A^ (6 A) < A^ (6 , b 2 ) and A^ (&i,6 2 ) < A^ (b ,b 2 ) 
which is sufficient for a function to be interval concave. Note that the first term in (99), a/bo, 
is decreasing in b , thus it suffices to show that f(s) defined as, 



log 



/(*) = — i ' , (ioo) 

is increasing in s when 1 < s < \/2 (s is the square root of the ratio of 6 max and b min and hence 
is greater than 1, also 6 m ax/^max is at most 2, since 6 min > 1/2 and 6 max < 1). Monotonicity of 
/ could be shown analytically. Its plot is provided in Figure 15. □ 

Now we have sufficient tools to prove Theorem 4. 

Proof of Theorem 4- For the sake of simplicity, we use A* k to denote A* k (6 m i n , b mayL ). For proving 
the concavity of the sequence, it suffices to prove that for k > 2, 

A* - AU > AU - AU, (101) 

where Aq is defined to be 0. Assume bo, . . . , b 2 k is the sequence given by Lemma 5 which 
maximize Note that since the sequence is geometric, the sequence foj, < % < 2 k , i = 
(mod 2) is the sequence maximizing Afc_i and also the sequence b-i, < i < 2 k , i = (mod 4) 
maximizes A^_ 2 - Hence, 

2 fc 

A * = E T— bj 7T^ b ^ - ( fl ll & o> *») » 

i=1 

= E |r=^ r ( fl ii 6 - 2 ' 6j ) - ** ( a ii 6 °' ' (102) 

2|i 

^-2 = E 7T— V** ( fl H 6 - 4 ' 6 ^ " ( a H 6 °> 6 ^ • 

~7 °2 fe — °0 
4|i 
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Note that when k = 2 the last equality reduces to Aq = which is consistent with our definition. 
Subtracting A^_ x from A* k and simplifying, 



2 

bi - fev_5 



i=l 

2\i 

■jfe 



2 fc — Oq 



i = i 
2|i 

9* 



b 2 k - b b 2 k - b 

f-* 2 * - b Q V »i - Oi-4 &i - 



»=1 
4\i 

-,k 



2 



(103) 



i— 1 

2|i 

where we have used the fact that since bi is geometric, = a/&A^2- Similarly, 

"~ °2 fc — °0 

1 — 1 

4|i 

Now, 

2 fc 

&i - 6i. 



(105) 



where we have used the fact from Lemma 6 that A^ is interval concave. □ 



C Proofs for Maximizing Nash Collective Utility 



Proof of Theorem 5. As we have already discussed, Wp» > Wp, and it remains to prove the 
other side. First we assume that t = 2 and then using induction and chain rule we will show 
the general case of t > 2. 

Note that maximizing W-p is equal to maximizing 



Similarly, 



- log W v = max E ~ E l °s( h l v ij) = max E a ~ E lo g( b i v *j) 

i j=l 

= max V«iE [log(b^Vi)] 



bi:fc ' Tlj . 

i j=l 



- log W-p/ = max ( a ia E log(b' 1 * a V la 



n 



b ia> b 'l6> b 2:fc 



log(b£v 16 ) 



+ E ^ E [log(b?V0] ) 



i=2:fe 
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Let bi-.k optimizes the first expression ^logW-p, and b' la ,b' lb ,h' 2 . k optimizes ^logW-pr. We 
have 

Elog(b? a V la ) = £>(Via = vi a )tog(b£vi a ), 

Via 

Elog(b;* 6 V 16 ) = $>( V i& = v 16 ) log(b? 6 vi 6 ), 

Vi6 

Elog(biVi) = J>( v i = vi)log(b' lVl ) 

vi 

= E ( a ^P( v ia = vi) + a lb p{V lb = v^) log(b*Vi) 

vi 

=&la 5^P( V 1« = V l«) lo g( b l V la) + "16 X^( Vl6 = Vl& ) l0 S( b l V l&)- 

Via Vlj, 

Therefore 

ai„Elog(b? Vi a ) + a 16 Elog(b;* 6 V 16 ) - Elog(b*V 1 ) = 



X>( V ^ = v la ) log j + X>( V " = v lb ) log j . (106) 



Simplifying the first expression, 

E/,r M f b '/a V la \ M [ b '/a V la P( Vl = V la )p( V la = V la ) 
P(Vi = Vi a ) log Tl = > P(Vi a = V la log — — — — 

V ^ J Vl^la V ^ = = Vl «) 

/, r m /" b '/a V la P(Vi = Vi a ) 

V p(Vi a = Via) log -p — 

^ 1 b lVla P(Vi a = Vi„) 



VlaGVla 



/ A r M / P( V la = Via) 

V V (Vi = V^ 



VlaGVl 



^ 1 f f\7 ^ b la Vi a g(Vi = Via) 

< log > p(Vio = Vi )-p 7^ r 

Vvi^la ^ = Vl «) 

+ L>(p(v la )|b(vi)) 
= log( E MV 1 = v la ) b ^)+D(p(v la )|b(v 1 )) 

VviaGVla DlVla / 

< log ( P^ = Vl )^ I + ^(P(via)lb(vi)). 

VvieVi lVl / 



Similarly, 



J]p(V lb = v lfe )log (jjj^j <log (E^ V i = v i)i|^) +£>(p(v u )||p(v 1 )). 



Therefore 



«ia V la = v la ) log (l^) + «» E P(V 16 = v 16 ) log j 

< aialog (E^i = v ^^) + «- lo g (Ep( V i = ^f^) 
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+ ai a D(p(vi a )\\p(vi)) + a lb D(p(vi b )\\p(vx)) 
= aia- 

+ « 1 /(T; Vx) 
hence equation (106) implies that 



log f^XVi = Vl )^) + a u log(^p(V i = vi)§^; 



a la Elog(b'/ a V la ) + a 16 Elog(b'/ 6 V 16 ) -Elog(b*V 



< ax.bg ^pCVx = vx)^ + a lfc log (j>(Vi = vx)^ 



+ ax/(T; Vx), 



thus, 



-logWp, --logW v < a la \og Vp^ = Vx)^ ) + a lb log ( WVx = v x )^i ) 
n n ^ b lVl J \^ b lVl y 

< log f a la J2p(Vi = vi)^ + an 5>(Vi = Vx)^ 

\ VI 1 VI 1 



i=2:k v, b * Vi 



< + ai/(T;V 



(107) 



where in the last step we have used A < 1 where, 



A = a^PWi = vi)^ + a 16 5>(Vx = v x )^ + £ p(V< = v.) 



vi " 1 ' ' vi "i-i i = 2:fe Vj 



biVi bnVi ^ / — ' b-Vj 



We need to show that A < 1 but if we accept it for now, we can raise both sides of equation (107) 
to the power 2 and use the fact that na.i = rii to complete the proof for t = 2. 

We now show A < 1. Letting bx = ^* b' la + ^b 1{l and bj = b'j for z > 1, we can rewrite A 
in the following form: 



,=1* v, b < V ' 



Observe that 



njbj = n «jbj = n | c^bx + a^bj J 

i=X:fe i=l:k \ i=2:k J 

= n ax ( — b' la + — b' 16 ) + V aib; 
= n I «xabi a + ai&b' 16 + ^ J =1, 



therefore using Lemma 7 (below) we can conclude that A < 1. 
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We use induction to prove the general case of t > 2. Dividing V\ into t sub-clusters 
recursively by first dividing it into two sub-clusters Vi jQ = (Pi,i, • • • j^i.t-i) and Vim = Vt- If 
we define T to be 1 when a randomly chosen individual from cluster V\ belongs to cluster P l a 
and 2 when he belongs to V\Mi using induction hypothesis we have for the division in two steps, 

W T , < w v 2 niI{Vl ' T) 2 niaI{Vla ' Ea \ (108) 

where n la denotes the number of individuals in cluster V\ >a ,V la denotes the random variable 
indicating their valuations and E a is a random variable in {1, ... ,t — 1} for the cluster numbers 
in the second step. Note that T is a function of E, in fact 

fi E e {l,...,t- 1}, , , 

T = < J ' (109) 

[2 E — t, 

therefore 

/(v^hkv^.t) 

= I(V 1 ;T) + I(V 1 ;E\T), 

expanding the second term we have, 

J(Vi; E\T) = 7(V i; 77|T = l)p(T = 1) + 7(Vi; £|T = 2)p(T = 2). (Ill) 

Conditioned on T = 2, according to (109), E = t constant and therefore 7(Vi; E\T = 2) = 0. 
Conditioned on T = 1, Vi and E reduce to Vi a and E a respectively, therefore I(V\;E\T = 
1) = 7(Vi a ; Eg). Also T = 1 indicates being in the cluster P la , therefore p(T = 1) = n la /rii. 
Substituting this into (110) and comparing with (108) we have, 

W v > < W v 2 niI ^ Vl ' E \ 

which completes the proof. □ 

Lemma 7. With the above notations, if bi for i — 1 : k maximizes W-p and bi for i — 1 : k is 
such that ^2i=i n i^i = ^' ihen 

£*£KVi = i%)^<i. 

i=l:k Vi * 1 

Proof. We first prove that the function 

/(xi, . . . , Xfc) = £ a i £ p ^ Yi = Vi ) log ( x ^ Vi ) ' 

i=l:k Vi 

defined on division vectors x 1; ...,Xfc such that n Yli=i-k a * x « = 1 is concave. If we define 
the set of such divisions by B, the set B will be a convex set since if (x 1; . . . ,x fc ) 6 B and 
(y 1; . . . , y fc ) e B, the division (zi, . . . , z*) where z« = Axj + (1 — A)y^ where < A < 1 also 
satisfies n X)i=i-fc a i z i = 1- We claim that / is a concave function. If x, y and z are as defined 
above, we have 

/(zi, . . . , z fc ) = £ «i £ p ( Vi = v ^ log (( Axi + ( X ~ A )^)' Vi ) 

= £ a * £^( v * = v *) lo s ( Ax ' v * + i 1 - A )yN 

i=l:k Vi 

< A/(x 1 ,...,x fe ) + (l-A)/(y 1 ,...,y fc ), 
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where we have used the concavity of logarithm. Therefore since bj maximizes this function, its 
derivative in the direction of any other division vector in B such as b should be negative. If we 
write bi x = (1 — A)b, + Ab;, we should have 



lim^(/(b 1 , A ,...,b M )-/(b 1 ,...,b fe ))<0, 



if we simplify the limit, 



S a 2> £ p{ v * = Vj) log fe^ — 

i=l:fe Vj \ 1 

i=l:k Vj \ 

= 5>£kv ( =v ( ) f|^-i) 

i=l:Jfc v, \ U i V ' / 



(112) 



i=l:k Vj 

setting this less than or equal to zero results in the statement we wanted to prove. □ 
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